Goto

Collaborating Authors

 Energy Law


GLIDER: Grading LLM Interactions and Decisions using Explainable Ranking

arXiv.org Artificial Intelligence

The LLM-as-judge paradigm is increasingly being adopted for automated evaluation of model outputs. While LLM judges have shown promise on constrained evaluation tasks, closed source LLMs display critical shortcomings when deployed in real world applications due to challenges of fine grained metrics and explainability, while task specific evaluation models lack cross-domain generalization. We introduce GLIDER, a powerful 3B evaluator LLM that can score any text input and associated context on arbitrary user defined criteria. GLIDER shows higher Pearson's correlation than GPT-4o on FLASK and greatly outperforms prior evaluation models, achieving comparable performance to LLMs 17x its size. GLIDER supports fine-grained scoring, multilingual reasoning, span highlighting and was trained on 685 domains and 183 criteria. Extensive qualitative analysis shows that GLIDER scores are highly correlated with human judgments, with 91.3% human agreement. We have open-sourced GLIDER to facilitate future research.


From gymnastics to virtual nonholonomic constraints: energy injection, dissipation, and regulation for the acrobot

arXiv.org Artificial Intelligence

In this article we study virtual nonholonomic constraints, which are relations between the generalized coordinates and momenta of a mechanical system that can be enforced via feedback control. We design a constraint which emulates gymnastics giant motion in an acrobot, and prove that this constraint can inject or dissipate energy based on the sign of a design parameter. The proposed constraint is tested both in simulation and experimentally on a real-world acrobot, demonstrating highly effective energy regulation properties and robustness to a variety of disturbances.


Senate to grapple with AI's effect on US energy as regulation talks heat up

FOX News

Fox News correspondent Gillian Turner has the latest on the president's focus amid calls for an impeachment inquiry on'Special Report.' The top Republican on the Senate Energy Committee will warn Thursday against allowing U.S. artificial intelligence capabilities to fall into China's hands when the panel meets for a hearing on the topic. Senators returned to Capitol Hill just days ago after spending the month of August in their home states. AI is expected to be a prominent topic for lawmakers as they race to get ahead of the rapidly advancing technology. It's also the topic at the heart of Thursday's hearing led by Energy Committee Chair Joe Manchin, D-W.Va., and ranking member John Barrasso, R-Wyo., that aims to examine how AI has affected the U.S. energy sector and how the federal government can stay competitive in that lane.